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Abstract 

Weighted model counting (WMC) is a well-known inference task on 
knowledge bases, used for probabilistic inference in graphical models. We 
introduce algebraic model counting (AMC), a generalization of WMC to 
a semiring structure. We show that AMC generalizes many well-known 
tasks in a variety of domains such as probabilistic inference, soft con- 
straints and network and database analysis. Furthermore, we investigate 
AMC from a knowledge compilation perspective and show that all AMC 
tasks can be evaluated using sd-DNNF circuits. We identify further char- 
acteristics of AMC instances that allow for the use of even more succinct 
circuits. 



1 Introduction 



Today, some of the most efficient techniques for probabilistic inference employ 
reductions to weighted model counting (WMC) both for propositional and for 



relational probabilistic models (Park 2002 Sang et al. 2005 Darwiche 2009 



Fierens et al. 20111. On the other hand, it is well-known that probabilistic 



inference as well as many other tasks can be generalized to a sum of products 
computation over models with suitable operators from a semiring structure. 
This has led to common inference algorithms for a variety of different inference 



problems in a variety of different fields, see for instance 


Goodman 


(1999), 


Eisner 


et al. (20051, I 


vfeseguer et al. (20061, Green et al. (20071, Bacchus et al. 


2009), 


Larrosa et al. 


(2010), Baras and Theodorakopoulos ( 


2010) and 


iimmig et al. 


(2011 ). The work presented here builds on these two lines of work. 



As our first contribution, we introduce the task of algebraic model counting 
(AMC). AMC generalizes weighted model counting to the semiring setting and 
supports various types of labels, including numerical ones as used in WMC, 
but also sets, polynomials, Boolean formulae, and many more. It thereby also 
generalizes many different tasks from a variety of different fields. As our sec- 
ond contribution, we investigate how to solve AMC problems using knowledge 



compilation. As AMC is defined in terms of the set of models of a preposi- 
tional logic theory, we can exploit the succinctness results of the knowledge 
compilation map of Darwiche and Marquis (2002). We show that AMC can in 
general be evaluated using sd-DNNF circuits, which are more succinct than a 
direct representation of the set of models. Furthermore, we identify a number 
of characteristics of AMC tasks that allow for using even more succinct types of 
circuits. Our results generalize well-known results for satisfiability and model 
counting in circuits to broad classes of AMC tasks and extend the task classifica- 



tion in algebraic Prolog (Kimmig et al. 2011 ) to more succinct types of circuits 



As our third contribution, we provide conditions under which AMC generalizes 
semiring sums of products defined over derivations, that is, sequences of possibly 
repeated variables, instead of over models. 

This paper is organized as follows. We introduce algebraic model counting in 
Section[2j Section|3]provides task characteristics that allow for sound evaluation 
on specific classes of circuits and shows how these generalize previous results. 
We discuss future work and conclude in Section |4j 



2 Algebraic Model Counting 

We first define the task of algebraic model counting, provide some example 
instances, and briefly discuss its relationships to existing frameworks. The un- 
derlying mathematical structure is that of a commutative semiring. 

Definition 1 ((Commutative) Semiring). A semiring is a structure 
(A, ©, ®, e® , e®), where addition © and multiplication £§> are associative binary 
operations over the set A, © is commutative, <g> distributes over ©, e® £ A is the 
neutral element of®, e® £ A that of®, and for all a £ A, e® ®a = a<X>e® = e®. 
In a commutative semiring, ® is commutative as well. 

Algebraic model counting is now defined as follows: 

Definition 2 (AMC Problem). Given 

• a propositional logic theory T over a set of variables V, 

• a commutative semiring [A, ©, (8>, e®, e®), and 

• a labeling function a : C — > A, mapping literals C of the variables in V to 
elements of the semiring set A, 

compute 

MT)= (g)a(Z), (1) 

ieM(T) lei 

where A4(T) denotes the set of models of T. 
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Theorem 1. AMC generalizes satisfiability (SAT), model counting (#SAT), 
weighted model counting (WMC), probabilistic inference (PROB), sensitivity 
analysis (SENS), gradient (GRAD), probability of most likely states (MPE), 
shortest (S-PATH) and widest (W-PATH) paths, fuzzy (FUZZY) and It- 
weighted (k WEIGHT) constraints, and 0BDD < construction. 

Proof. By verification of the definitions in Table [T] □ 

SAT, #SAT, WMC and PROB are well-known tasks that appear in many 
fields. SENS and GRAD allow one to investigate the effect of parameter 
changes and to learn parameters in a probabilistic setting, respectively. In 
GRAD, labels are tuples (pi,gi) with pi £ [0, 1] the probability of Vi and gt the 
gradient with respect to the kth variable: 

«<*>={ fcif Vi7i (2) 

-(-"-{to" VrA p> 

(01,03) © (61,62) = (ai + &i,02 + 62) (4) 
(01,02) <8 (61, 6 2 ) = (ai • 61, a\ -b 2 + a 2 - 61) (5) 

If the second element of the label denotes a cost, the GRAD semiring calculates 
expected costs. MPE, S-PATH, W-PATH, FUZZY and fc WEIGHT (with 
bounded addition + k ) are examples of optimization tasks. Finally, AMC can 
also be used to construct a canonical representation of the set of models in the 
form of an DBDD < circuit, a popular data structure in many fields of computer 
science. The last two tasks in the table originate from probabilistic databases 
under the positive relational algebra 1ZA + and are not easily extended to nega- 
tive literals. Why-provenance (WHY) collects the set of identifiers of all tuples 
an answer depends on, whereas 1ZA + -provenance constructs polynomials that 
also take into account the number of times the tuples are used. While all tasks 
listed in Table [l] are representative examples from the literature, cf. the refer- 
ences given in the table, this is by no means an exhaustive list of semirings and 
labeling functions that can be used for AMC. 

As these examples illustrate, the AMC task shares its basic structure with a 
number of other tasks. The class of sum-of-products problems generalizes factor 
graphs to the algebraic setting, but uses discrete valued variables or factors as 



basic building bloc ks (|Bacchus et al. 20091. In this context, affine algebraic 



decision diagrams panner and McAllester 2005) and AND/OR multi- valued 



decision diagrams flMateescu et al. . 2008 ) have been used for inference with 



real-valued semirings. The restriction to two-valued variables allows us to di- 
rectly compile AMC tasks to propositional circuits without adding constraints 
on legal variable assignments to the theory. In soft constraint programming, 
additional constraints are imposed on the semiring, which ensure that addition 



optimizes the degree of constraint satisfaction (Meseguer et al. 2006). Wil- 



son ( 2005 1 provides an algorithm that compiles semiring-based systems into 
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semiring-labelled decision diagrams, which are closely related to unordered bi- 
nary decision diagrams, to compute valuations. Semiring-induced prepositional 
logic labels clauses with semiring elements with a weight associated to their falsi- 



fication and is restricted to semirings whose induced pre-order is partial ( Larrosa 



et al. 2010). In algebraic Prolog (aProbLog), a semiring-labeled logic program 



is reduced to AMC for inference (Kimmig et al. 2011) 



While AMC sums over models, other tasks sum over sequences of pos- 
sibly repeated variables. Examples include algebraic path problems (Baras 
and Theodorakopoulos 2010), semiring parsing (Goodman 1999[), prov enance 
semirings for positive relational algebra queries in databases ( Green et al. 2007 ), 



and semiring- weighted dynamic programs (Eisner et al. , 2005 ) . We will discuss 



the difference between such derivation-based settings and AMC in more detail 
in Section [331 



3 AMC using Knowledge Compilation 



In their knowledge compilation map, Darwiche and Marquis| ( |2002[ ) provide an 



overview of succinctness relationships between various types of propositional 
circuits. Furthermore, they show which reasoning tasks in propositional logic, 
such as (weighted) model counting (#SAT/WMC) or satisfiability checking 
(SAT), are evaluated on which circuits in time polynomial in the size of the 
circuit. Propositional circuits are often used as a representation language in 
weighted model counting and similar tasks, including for instance probability 



calculation and sensitivity analysis in probabilistic databases (Jha and Suciu 



2011 



Kanagal et al. 20111 and inference in algebraic Prolog (Kimmig et al 
20111. In the following, we extend this approach to AMC. We first repeat 
the relevant knowledge compilation concepts, closely following |Darwiche and| 
Marquis| ( [2002| . 



Definition 3 (NNF). A sentence in negation normal form (NNF) over a set of 
propositional variables V is a rooted, directed acyclic graph where each leaf 
node is labeled with true (T), false (_L), or a literal of a variable in V, and each 
internal node with disjunction (V) or conjunction (A). 

Definition 4 (Decomposability). An NNF is decomposable if for each conjunc- 
tion node A"=i 4>h n0 two children fa and fa share any variable. 

Definition 5 (Determinism). An NNF is deterministic if for each disjunction 
node VT=i 4>ii each pair of different children fa and fa: is logically inconsistent. 

Definition 6 (Smoothness). An NNF is smooth if for each disjunction node 
V"=i eacn child fa mentions the same set of variables. 

DNNF, d-NNF, s-NNF, sd-NNF, d-DNNF, s-DNNF, and sd-DNNF arc the sub- 
sets of NNF satisfying (combinations of) these properties, where D stands for 
decomposable, d for deterministic, and s for smooth. For instance, the circuit 



in Figure la is in sd-DNNF, while the one in Figure lb has none of the three 
properties. 
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(a) sd-DNNF 



(b) NNF 



Figure 1: Example of an sd-DNNF and NNF circuit. 
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Table 2: Semiring characteristics and corresponding circuits that allow for sound 
AMC evaluation. 



The algebraic model count A(T) is defined as a summation over the set 
of models M.(T) of a prepositional theory T, which corresponds to the MODS 
language in the knowledge compilation map. However, as this MODS language is 
exponentially less succinct than any other representation of T included in the 
map, converting to MODS in order to evaluate Equation ([lj directly is undesirable. 
In the following, we therefore establish a connection between characteristics of 
AMC tasks and properties of the NNF circuits they can be evaluated on, resulting 
in the classification scheme summarized in Table [2j 

The key idea underlying NNF evaluation is to perform a bottom-up pass 
over the circuit, labeling each node with the value of the subcircuit rooted at 
that node. For disjunction nodes, the values of all their children are combined 
using ffi, for conjunction nodes using (g>. 

Definition 7 (NNF Evaluation) . The function Eval specified in Algorithm [l] 
evaluates an NNF circuit for a commutative semiring (A, ©, <8>, e® , e®) and label- 
ing function a. 

Consider for example #SAT for the two circuits in Figure [TJ which both 
represent an exclusive OR of two variables. Evaluation of the sd-DNNF in Fig- 
ure la which in fact is a MODS representation, assigns label 1 to each leaf, 1-1 = 1 
to each conjunction node, and 1 + 1 = 2 to the disjunction node at the root 
and thus the entire circuit, which is correct. On the other hand, evaluation on 
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Algorithm 1 Evaluating an NNF circuit N for a commutative semiring 
(A, ffi, ®, e® , e®) and labeling function a. 

I: function Eval(7V, ©, ®, e® , e®, a) 

2: if A is a true node T then return e® 

3: if TV is a false node _L then return e® 

4: if A is a literal node I then return a(Z) 

5: if A is a disjunction VfcLi then 

6: return 0™ x EvAL(A i; ®, ®, e® , e® , a) 

7: if A is a conjunction Al!li then 

8: return (g),™ 1 Eval(A;, ©, ®, e® , e® , a) 



the general NNF in Figure lb assigns 1 + 1 = 2 to each disjunction node and 
2-2 = 4 to the conjunction node at the root. This overestimation is due to mod- 
els shared by the children of the same disjunction node and variables shared by 
the children of the conjunction node, as we will see in more detail in Section [372] 
and 13.31 



Definition 8 (Soundness). Evaluating an NNF representation Nt of a prepo- 
sitional theory T for a semiring [A, ©, <8>, e® , e®) and labeling function a is a 
sound AMC computation iff Eval(7V t , ©, <£>, e® , e® , a) = A(T). 

In the following, we establish a general soundness result for AMC evaluation 
on sd-DNNF circuits as well as properties of AMC tasks that guarantee soundness 
for various other subclasses of NNF. Given soundness, we inherit the polynomial 



complexity results of the knowledge compilation map (Darwiche and Marquis 



2002 1 for semiring operators with unit cost. Note however that there are semir- 
ings with more expensive operators. For instance, labels in DBDD< may grow 
exponentially in the circuit size. 

3.1 sd-DNNF Evaluation 

We show that AMC evaluation is sound on sd-DNNF circuits. As these are 
strictly more succinct than MODS representations, they allow for more efficient 
inference. 

Theorem 2 (sd-DNNF Evaluation). Evaluating an sd-DNNF representation of 
the propositional theory T is a sound AMC computation. 

Proof. We show that Eval(A t , ©, <8>, e® , e®, a) for an sd-DNNF representation 
Nt of the theory T computes A(T) with respect to all variables in Nt- 

1. Line 2: A(T) = /e{0} <g) i£/ a(l) = e® 

2. Line 3: A(_L) = /e{} ® JeJ a(l) = e® 

3. Line 4: A(Z) = /e{{;}} <g> fee/ a(k) = a(l) 
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Due to associativity and commutativity of the semiring operators, operands of 
each summation and each multiplication can be evaluated in arbitrary order. 
We therefore restrict ourselves to binary disjunction and conjunction nodes here. 
Given sound evaluation for subcircuits fa with variables Vi and models Mi with 
respect to these variables, we obtain: 

4. Lines 5-6: Disjunction node fa V fa: As V = Vi U V 2 = Vi due to smooth- 
ness, we obtain M(fa V fa) — Mi U M2, which is a disjoint union due 
to determinism. Therefore, A{fa) (B A(fa) = i=1 2 ©a-1; ®iei a (0 = 
® Ml uM 2 <8W a ( l ) = M^i V fa). 

5. Lines 7-8: Conjunction node fa A fa: As Vi n V 2 = due to decompos- 
ability, the set M(fa A fa) of models of the conjunction is simply the 
set of all unions of models of its parts. Together with distributivity, we 
get A(fa) A(fa) = i=12 ©_ Mi 0, eJ a(l) = ® M{ ^ 2) ® leI a(l) = 
A(fa A fa). 

□ 

Clearly, the soundness of AMC evaluation on sd-DNNF depends on all three 
properties of this subclass of NNF. On the other hand, circuits without these 
properties may be exponentially smaller and thus allow for more efficient infer- 
ence. In the following, we therefore analyze evaluation in the absence of these 
properties, which allows us to identify characteristics of the semiring and la- 
beling function that ensure sound evaluation on the corresponding classes of 
circuits. 



3.2 Evaluation on other Decomposable Circuits 

If a circuit is not deterministic, children of a disjunction node may have common 
models, in which case evaluation sums over such shared models multiple times. 



For instance, consider the circuit in Figure lb with PROB, a(a) = 0.6 and 
a(b) = 0.3. Evaluation on this circuit results in 0.6 + 0.3 = 0.9 for the right 
disjunction node, while A(a V b) = 0.6 • 0.3 + 0.4 • 0.3 + 0.6 • 0.7 = 0.72. 

Definition 9 (Idempotent Operator). A binary operator over a set A is 
idempotent iff Va € A : a a = a. 

Theorem 3 (s-DNNF Evaluation). Evaluating an s-DNNF representation of the 
propositional theory T for a semiring with idempotent is a sound AMC com- 
putation. 

Proof. Reconsider point (4) of the proof of Theorem [2j With smoothness, 
but without determinism, A^i(0i) U Ai2{fa) is no longer a union of disjoint 
sets, and A(4>i) A(0 2 ) = © i= i 2 ©m ®fe/ a (0 sums over the models in 
M\(fa) r\M2(fa) twice. Due to associativity and commutativity, this is sound 
for idempotent 0. □ 
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If a circuit is not smooth, the children of a disjunction node may use different 
sets of variables. Each model of a child node corresponds to a set of models 
for the full set of variables, but evaluation on a non-smooth circuit ignores the 
labels of unmentioned variables. For instance, consider the circuit in Figure [Tb| 
with MPE, a(a) — 0.6 and a(b) — 0.3. Evaluating the right disjunction node 
of this circuit results in max(0.6, 0.3) = 0.6, while A(a V b) = max(0.6 ■ 0.3, 0.4 • 
0.3,0.6-0.7) = 0.42. 

Definition 10 (Neutral (ffi,a)). A semiring addition and labeling function 
pair (®, a) is neutral iff Vw G V : a(v) ® a(-iv) = e®. 

Theorem 4 (d-DNNF Evaluation). Evaluating a d-DNNF representation of the 
propositional theory T for a semiring and labeling function with neutral (®,a) 
is a sound AMC computation. 

Proof. For lines 5-6 (point (4) of the proof of Theorem [2| to be sound, the sum 
of the AMCs computed by the children over their sets of variables Vt has to be 
equal to the AMC of the entire disjunction over the full set of variables V\ U V%. 
Given the child AMC Ay 4 (^), adding a variable v to Vi replaces each model I 
of (f>i by two models I + = I U {v} and I~ = I U {^v}. Due to distributivity, 
commutativity and the neutral sum property, the algebraic sum of these two 
models equals the AMC of the original model: 

A.y 4 u{»}C0 = Ay 4UW (J + ) © A v . u{v} (I-) 

= (a(v) © a(-*v)) ® a(l) 
lei 

= ®a(0 = A Vi (i) 
lei 

Evaluation therefore computes A Vl (4>i) © Ay 2 [<f> 2 ) = A Vl uv 2 (<^i) © A Vl uv 2 (^2), 
which due to determinism is equal to Ay 1 uv 2 (0i V (j>2). □ 

Note that from a practical point of view, non- neutral (©,a) does not in- 
fluence the tractability of inference, as any NNF can be smoothed in polytime 



preserving determinism and decomposability (Darwiche and Marquis 2002). 

The previous two results can directly be combined for DNNF circuits that are 
neither smooth nor deterministic. 

Theorem 5 (DNNF Evaluation). Evaluating a DNNF representation of the propo- 
sitional theory T for a semiring and labeling function with idempotent and neu- 
tral (ffi, a) is a sound AMC computation. 

Proof. Reconsider point (4) of the proof of Theorem [2] Due to neutral (©, a), 
values for all children of a disjunction node are correct with respect to the full 
set of variables (Theorem EJ). Due to idempotent ©, multiple occurrences of the 
same model do not influence the result (Theorem |3| . □ 

This completes the left part of Table [2] where no conditions are imposed on 
semiring multiplication. 
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3.3 Evaluation on Non-Decomposable Circuits 



If a circuit is not decomposable, the children of a conjunction node may share 
variables. In this case, simply combining all pairs of their models may produce 
sets of literals that are not models, because they either contain contradicting 
literals, or several copies of the same literal, which results in erroneous extra 
multiplications. For instance, in Figure [Tb] {^a, 6} is a model of both disjunction 
nodes, and {a, b} of the right one only. The conjunction node sums among others 
the products ct(-ia) (g a(b) (g a(a) (8) a (6), which does not correspond to a model, 
and a(->a) (g a(b) (g) a(->a) <g a(b), where labels of both literals are multiplied 
twice. 

Definition 11 (Consistency-Preserving ((g), a)). A semiring multiplication and 
labeling function pair (<g,a) is consistency-preserving iffVv £ V : a(i>)<ga(^v) = 
e®. 

Theorem 6 (sd-NNF Evaluation). Evaluating an sd-NNF representation of the 
propositional theory T for a semiring and labeling function with idempotent and 
consistency-preserving ((g), a) is a sound AMC computation. 

Proof. Reconsider point (5) of the proof of Theorem [2j The set of models 
of <f>i A 02 contains exactly all pairwise combinations of models of its parts 
that agree on all shared variables. The circuit evaluates the AMC of <j>i A 4>2 
as A((/>i) (g A((/>2), which due to distributivity is the sum over all pairwise 
combinations of models. Without decomposability, each such combination / 
contains two literals for each v E Vi n V2. As (g is associative, commutative 
and idempotent, repeated occurrences of a literal I in ® it£ ia(i) do not affect the 
result. If {1,^1} C /, ® i( zia{i) includes a multiplication by a(l) <g> a(^l) = e®, 
which in a semiring means (g)igja(i) = e®. Such inconsistent / thus do not 
contribute to the semiring sum. □ 

Theorem [6] affects only conjunction nodes, whereas Theorems [3j [2] and [5] 
only affect disjunction nodes. Their combination thus extends our results to 
non-decomposable circuits that do not satisfy (one of) the other two properties 
either: 

Theorem 7 (s-NNF, d-NNF, and NNF Evaluations). For a semiring and label- 
ing function with idempotent and consistency preserving ((g), a), evaluating the 
following representation of the propositional theory T is a sound AMC compu- 
tation: 

• s-NNF if © is idempotent 

• d-NNF if (©, a) is neutral 

• NNF i/(©,a) is idempotent and neutral 

This completes the right part of Table [2j where using non-decomposable 
circuits is possible as a consequence of restrictions on semiring multiplication. 
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Given a new AMC instance, this table allows one to immediately choose the 
appropriate type of circuit for efficient evaluation. For the examples discussed 
here, cf. Table^J these are: NNF for 0BDD<, DNNF for SAT, S-PATH, W-PATH 
and FUZZY, d-DNNF for PROB, SENS and GRAD, s-DNNF for MPE and 
fc WEIGHT, and sd-DNNF for #SAT and WMC. 



3.4 Discussion 



The results summarized in Table [2] generalize the complexity results for SAT 
and #SAT, provide more succinct types of circuits for inference in algebraic 
Prolog, and show that all circuits that are practically relevant for AMC are 
well-studied in the knowledge compilation map. We now address these points 
in more detail. 
First, 



Darwiche and Marquis (20021 show that SAT can be evaluated in 



polynomial time on DNNF, while #SAT can be evaluated in polynomial time on 
d-DNNF, as smoothing is possible in polynomial time. Our Theorem [5] general- 
izes soundness of DNNF evaluation to all semirings and labeling functions with 
idempotent and neutral (©, a), which includes SAT, while our Theorem [2] gen- 
eralizes soundness of sd-DNNF evaluation to all semirings and labeling functions, 
including those with non- idempotent, non-neutral (®,a), such as #SAT. As 
discussed above, the complexity of evaluation is polynomial if each semiring 
operation has unit cost. 



Second, [Kimmig et al. (2011 ) reduce inference in algebraic Prolog (aProbLog) 
to AMC evaluation on disjunctive normal form (DNF). For non-idempotent ad- 
dition, the DNF is compiled into an ordered binary decision diagram (OBDD) if 
its conjunctions are not mutually exclusive. For non-neutral (©, a), circuits are 
smoothed before evaluation. This results in the following settings: 





neutral (©, a) 


non- neutral (©, a) 


idempotent 


DNF 


s-DNF 


non-idempotent © 


d-DNF 
OBDD 


sd-DNF 
s-OBDD 



Table [2] uses the same characteristics of semiring operators and labeling func- 
tion, but does not assume a DNF as starting point and thus also does not rely 
on properties of such a DNF. Its left half generalizes the aProbLog scheme to 
more succinct superclasses, namely (s-)DNNF instead of (s-)DNF, and (s)d-DNNF 
instead of (s)d-DNF or (s-)OBDD. 

Third, we observe that while there are interesting inference tasks that allow 
for sound evaluation on the more succinct class of DNNF instead of the general 
sd-DNNF evaluation, the conditions for sound evaluation on non-decomposable 
circuits are too strict in most practical cases. This is in line with the knowledge 
compilation map, which excludes non-decomposable circuits (with the exception 
of the most general class NNF) as they do not support any of the studied tasks 



in polytime (Darwiche and Marquis 2002). 
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(a) Path problem 



(b) Semiring grammar 



Figure 2: NNF circuits for an algebraic path problem and a semiring grammar. 
See Section 13.51 for details. 



3.5 AMC and Algebraic Derivation Counting 

While AMC is a sum over models, many other semiring-based tasks require 
a sum over derivations, that is, sequences of possibly repeated variables. Ex- 



amples include algebraic path problems (Baras and Theodorakopoulos 2010) 



semiring parsing (Goodman 1999), provenance semirings for positive relational 
algebra queries in databases (Green et al. 2007), and semiring- weighted dy- 



namic programs (Eisner et al. 2005). We refer to this type of task as algebraic 
derivation counting (ADC). While AMC and ADC appear very similar, they 
cannot easily be exchanged, as we illustrate next. We restrict the discussion to 
finite ADC. 

Where AMC is based on a MODS representation, that is, a smooth, determin- 
istic DNF, ADC is based on a type of NNF that is not included in the knowledge 
compilation map: a disjunction of conjunctions of possibly repeated variables. 
Because of the repeated variables, this is not a DNF in general. Figure [2] shows 
two examples of such circuits. The NNF in Figure [2a] encodes the two paths 
between nodes s and t in a graph with three edges (s,t), {s,r) and (r, t). 
It is decomposable, as the conjunction does not repeat variables, but neither 
smooth nor deterministic and thus a DNNF. The NNF in Figure [2b] encodes the 
two derivations of aa in a context-free grammar with rules S — » aS \ AA | e and 
A — > AA | a. Variable Xi denotes an application of the ith production rule for a 
non-terminal X, that is, the right conjunction in the circuit encodes the deriva- 
tion S — > AA — > aA — > aa. The circuit is neither decomposable nor smooth, and 
also not deterministic, even though the derivations in the underlying grammar 
setting are mutually exclusive (cf. below). 

In order to represent an ADC task as an AMC task, the labeling function a 
needs to be extended to negative literals. Together with the semiring, such an 
extension determines one of the settings of Table [2] If the ADC's NNF belongs 
to the class of circuits corresponding to this setting, the ADC and AMC tasks 
coincide. Clearly, this is always the case for commutative semirings and label- 
ing functions with idempotent, consistency-preserving ((g), a) and idempotent, 
neutral (0, a) such as for instance 0BDD<. However, as noted above, such tasks 
are rare. 
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For an ADC instance with idempotent addition defined on a DNNF, that is, 
without repeated variables in conjunctions, an equivalent AMC instance can 
be constructed if a can be extended such that (©, a) is neutral. For instance, 
consider the circuit in Figure 2a in a shortest path setting. By setting a(->v) = 
for all variables v, the S-PATH semiring in Table [T] provides an equivalent AMC 
task. 

As the NNF circuits of ADC do not contain negation, they are not deter- 
ministic. However, the underlying tasks often impose additional constraints on 
their variables. For instance, in the grammar example in Figure [2b} the leftmost 
children si and s2 of the two conjunction nodes are in fact mutually exclusive, 
as only one of the right hand sides for S can be chosen as first step in a deriva- 
tion. Such an n-ary variable can be encoded using n binary variables by adding 
corresponding constraints to the theory that restrict legal value assignments. 
For some semirings, for instance PROB, it is sufficient to adapt the labels of 
these variables without adding constraints. It is an open question under which 
general conditions such label transformations are possible. 

Conversely, every AMC task can be trivially represented as an ADC task by 
introducing a derivation for each model. However, this is clearly not desirable 



from a complexity point of view. Baras and Theodorakopoulos (2010) provide 



an ADC encoding of network reliability under probabilistic edge failure, that 
is, the PROB AMC task for a positive propositional formula. They essen- 
tially modify multiplication to filter repeated literals, and addition to subtract 
shared models of its operands, which drastically increases complexity of these 
operations. Again, it is an open question under which general conditions such 
transformations are possible. 



4 Conclusions and Future Work 

We have introduced the task of algebraic model counting, which generalizes 
weighted model counting to a semiring setting and thus to various types of la- 
bels, including numerical ones as used in WMC, but also sets, polynomials, or 
Boolean formulae. We have shown that evaluating AMC is sound on sd-DNNF 
circuits, which are known to be more succinct than the MODS language used in 
its definition. Furthermore, we have provided characteristics of AMC tasks that 
guarantee sound evaluation on more succinct classes of circuits. This classifica- 
tion not only provides a means of directly choosing a circuit type that allows 
for efficient inference given a new AMC task, but also generalizes a number of 
known results and provides a framework to map restricted types of algebraic 
derivation counts onto AMC tasks. 

Given the results presented here, it is worth investigating which other al- 
gebraic representations can be reduced to algebraic model counting. Another 
line of future work concerns the introduction of additional operators that would 
make it possible to express additional tasks, for instance, partial MAP, which 
requires a maximization operator in addition to summation and multiplication. 
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